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Description 

NOVEL POLYPEPTIDE S AND DNAs ENCODING THEM 

the iw p^y% n pt^r tiOT ^ rel3t ^ 

The cells which make up a brain may be largely grouped into neuron and glial cells. Neurons play a major role in 
the transmission and treatment of information in a brain. <a JO(l o iei n 

^TJ^T hand ' 9 !' al Ce " S SUPP ° rt the aC,k)n ° f n8UronS SP^Iftally 9lial cells are associated with protection 
IT/^T, T 0 " 8 ' ,ma,IOn °' mye ' in Shea,h ' ,0rrna,ion of bl00d brain barrier - *"very of nutrition to neurons 
ne^l " eUro,ransmi,ters ^ are furth *' ^sociated with the control of proliferation and differentiation of 

»Z Z r " " r^" 9, 6tC - Th8re 8re VarbUS kind8 °' 9lial ce,,s includi "9 astroc y^- oligodendrocytes 

£2"; the centra, ne^ous system, and Schwann cells and mantle cells in the peripheral nervous system 
Ependymal cells existing in the cerebral ventricular endothelium are also glial cells 

fonnH 80 ?* man f y th humoral ,ac,ors « or the differentiation, proliferation and growth of neurons and glial, have been 
found, and most of them are generated by glial or tumorous glial cells (referred as glia etc. as a whole hereafter) 
w J 0 !^?/.!' 9r0Wth ' ac,or < NGF - m w 13 5 kd (human)), brainderived neurotrophic factor (BDNF. m 
w ca. 13.5 kd (human)), neuroblastoma growth inhibitory factor (NGIF, m.w. ca. 5 kd (human)) and glia-derived neu- 

° f (m , W ? ^ ^ * 110 ^ (b ° ,h fatS)) ' a " are kn ° Wn as ^s acting' on neuron, are 
fnn m , T Furt "er, glia-derived glial growth inhibitory factor (GdGGIF. m.w. more than 100 kd (rats)) 

2^52^ growth factor (PDGF, m.w. ca. 32 kd (human)), both acting on glia, are also generated from as- 
trocytes. Furthermore. gl.adenved neurotrophic factor (GdNTR m.w. ca. 43 kd (rats)) exhibiting a neurotrophic activity 
and a prolrfera ,ng activrty on glia, glia maturation factor (GMF. m.w. ca. 16.5 kd (bovine)) exhiMing a neuroUoph* 
actrvrty and a stimulatory activity on proliferation and differentiation of glioblastoma! cilially neurotrophic factor (CNTF 

, (rabbrt l and ra,s )). exhibitin 9 an act'vrty of life maintenance of ganglion cells, an inhibitory activity of 
prolrferation of sympathetic ganghon cells, a stimulatory activity of differentiation of sympathetic ganglion cells and 

of neurons and astrocytes, all are generated from astrocytes. Schwann celWerived neuron growth factor (SchNTF m 

FN7YMP ^ } ^ neUr ° n " 9 enera,ed ,rom Schwan " "lis (see PROTEIN, NUCLEIC ACID AND 

ENZYME, vol.36 (No. 7), p.260, 1991 ,n detail). It has also been found that interleukines (IL-1, IL-6 etc ) which are 
believed to be mainly secreted by immunocytes are also secreted from glial 

The amino acid sequences of most of these factors have been revealed. They are of interest not onfc for pure 
research but also for the research and development of pharmaceuticals. P 

The present inventors have sought to find novel factors (polypeptides) which glia etc generate 

an in?pnTHr. al me ! h0dS ,0 ° btain 3 Parti ° Ular P°'^ e P tide or a °NA encoding it, generally utilize methods in which 
anrniT hiologicalactivrty is confirmed in a tissue or in a cell medium, the active polypeptide is isolated and purified 

blotogicalaS^ P ° fw>eptWe 18 Cloned ' or utilise methods of ^pression^oning with the guidance of the 

However, physiologically active polypeptides in the living body have often many different kinds of activity Therefore 
. is increasingly found that after a gene is cloned, the gene is identical to one encoding a polypeptide aJ^K? 
Abo gl,a generates only a very small amount of most factors and this makes it difficult to isolate and to purify a factor 
and to confirm its biological activity. 

More recently sequencing technology has been.rapidly developed, and methods based on "Reverse Genetics" 
for characterizing the function of a gene from the sequence of the gene, have also been greatly developed 
Th * presentment 

y ? n^T ' r0m 9113 6tC ' ° b,ainin9 CDNA by usi "9 mRNA ,hus e>"oidatin 9 £ nt 

deot.de sequence of a cDNA and deducing the corresponding amino acid sequence. DNA thus obtained was trans- 
formed into a mammalian host cell to express a novel polypeptide as recombinant protein which was then purified 
»c 17° h? T^?? Pr0,ei " W3S carried 0Ut ^ the sec ' uence has been confi ™d to be tfentical 
Lp22 T % mS C l NA - FUrther ' me amin ° 3CkJ Sequence of ,he N - ,erminal end °' « s ™*™ P^tein has been 

S^J^"™ ™^ inVen, ° rS h3Ve SUCCeed9d in iden,ifiyin 9 a P revious| V ""identified secretab.e 

polypeptide and DNA encoding it, to complete the present invention 

.h»^ e n!n«H S n ° P ° lyPeP, " e haVin9 30 amin ° acW Sequence M is kJen,ical t0 - or ha s high homology to, that of 
he polypept.de of the present invention, when the amino acid sequence of me polypeptide was compared by a computer 

l^r n JT en ?r ? SWiSS Pr0t (SwiSS Pr ° l Release *»■ F^ermore, there was no nucleotide se- 

rZtnn ,S ,0 ' ° f "J!? hi9h h0m ° l09y ,0 ' ,hat enCOdi " 9 me t**™™* of P' ese " '"ventton. when 

Therefore, the polypeptide of the present invention is considered to be quite novel. 
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Brief description of the drawings 

Figure 1 shows the construction of plasmid vector, pVICS-1. 

Figure 2 shows the construction of a recombinant DNA into which cDNA derived from human glioblastoma cell 
line T98G is inserted. 

Figure 3 shows a hydrophobicity profile of the polypeptide of the present invention. 
Figure 4 shows the construction of an expression vector, pUC-SRctML1 . 
Figure 5 is a photograph showing patterns of SDS-PAGE in Example 10. 

The present invention thus provides a polypeptide having the amino acid sequence shown in Seq ID No 1 or 3 
in substantially purified form, homologues thereof having at least 90% homology with the amino acid sequence shown 
in SEQ ID No. 1 or 3 over a region of at least 30 contiguous amino acids, or a fragment of the sequence, or a fragment 
of such a homologue comprising at least 20 amino acids. The invention further provides a DNA coding for such a 
polypeptide. A further embodiment of the invention provides a DNA having a nucleotide sequence shown in SEQ ID 
No. 1, 3 or 5 and fragments thereof comprising at least 20 nucleotides capable of selectively hybridizing to SEQ ID 
15 No. 1, 3 or 5. 

The present invention in particular relates to: 

(1) a polypeptide having an amino acid sequence shown in SEQ ID NO. 1, 

(2) a DNA encoding the polypeptide described above (1 ), 

(3) a DNA having a nucleotide sequence shown in SEQ ID NO. 1 , 

(4) a polypeptide having an amino acid sequence shown in SEQ ID NO. 3, 

(5) a DNA encoding the polypeptide described above (4), 

(6) a DNA having a nucleotide sequence shown in SEQ ID NO.3, and 

(7) a DNA having a nucleotide sequence shown in SEQ ID NO.5. 
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A polypeptide of Seq. ID No. 1 or 3 in substantially purified form will generally comprise the polypeptide in a 
preparation in which more than 90%, eg. 95%, 98% or 99% of the polypeptide in the preparation is that of the Sea ID 
No. 1 or 3. ^" 

A polypeptide homologue of the Seq. ID No. 1 or 3 will be generally at least 90% and preferably at least 95% 
homologous to the polypeptide of Seq. ID No. 1 or 3 over a region of at least 30, preferably at least 40, for instance 
60 or 100 contiguous amino acids. Such polypeptide homologues will be referred to below as a polypeptide accordina 
to the invention. y 

Fragments of Seq. ID No. 1 or 3 or its homologues will be at least 20, for example 25, 30, 40, 50 or 60 amino acids 
in length, and are also encompassed by the term "a polypeptide according to the invention" as used herein. Particular 
fragments of the polypeptides of the invention are fragments of Seq. ID. No. 3 which include Seq ID No 1 or a 
homologue thereof. 

A DNA capable of selectively hybridizing to the DNA of Seq. I D No. 1 , 3 or 5 will be generally at least 70% preferably 
at least 80 or 90% and more preferably at least 95% homologous to the DNA of Seq. ID No. 1 , 3 or 5 over a region of 
at least 20, preferably at least 30. for instance 40, 60 or 100 or more contiguous nucleotides. Such DNA will be en- 
compassed by the term "DNA according to the invention". 

DNA according to the invention may be used to produce a primer, eg. a PCR primer, a probe eg. labelled by 
conventional means using radioactive or non-radioactive labels, or the DNA may be cloned into a vector. Such primers 
probes and other fragments of the DNA of Seq. ID No. 1 , 3 or 5 will be at least 20, for example 25, 30 or 40 nucleotides 
in length, and are also encompassed by the term 'DNA according to the invention 1 as used herein. 

DNA according to the invention may be produced recombinantly, synthetically, or by any means available to those 
of skill in the art. 

A further embodiment of the invention provides replication and expression vectors comprising DNA according to 
the invention. The vectors may be, for example, plasmid, virus or phage vectors provided with an origin of replication 
optionally a promoter for the expression of the said DNA and optionally a regulator of the promotor The vector may 
so contain one or more selectable marker genes, for example an ampicillin resistance gene. The vector may be used in 
vitro, for example for the production of RNA corresponding to the DNA, or used to transform a host cell. ~ 
A further embodiment of the invention provides host cells transformed or transfected with the vectors for the rep- 
lication and expression of DNA according to the invention, including the DNA Seq. ID No. 1 , 3 or 5 or the open reading 
frame thereof. The cells will be chosen to be compatible with the vector and may for example be bacterial, yeast insect 
55 or mammalian. 

DNA according to the invention may also be inserted into the vectors described above in an antisense orientation 
in order to provide for the production of antisense RNA (DNA). Antisense RNA (DNA) may also be produced by synthetic 
means. Such antisense RNA (DNA) may be used in a method of controlling the levels of a polypeptide of the invention 
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in a celt. 

A further embodiment of the invention provides a method of producing a polypeptide which comprises culturing 
host cells of the present invention under conditions effective to express a polypeptide of the invention. Preferably, in 
addition, such a method is carried out under conditions in which the polypeptide of the invention is expressed and then 
secreted from the host cells. 

The invention also provides monoclonal or polyclonal antibodies to a polypeptide according to the invention. The 
invention further provides a process for the production of monoclonal or polyclonal antibodies to the polypeptides of 
the invention. Monoclonal antibodies may be prepared by conventional hybridoma technology using a polypeptide of 
the invention or a fragment thereof, as an immunogen. Polyclonal antibodies may also be prepared by conventional 
means which comprise inoculating a host animal, for example a rat or a rabbit, with a polypeptide of the invention and 
recovering immune serum. 

The present invention also provides pharmaceutical compositions containing a polypeptide of the invention, or an 
antibody thereof, in association with a pharmaceutical ly acceptable diluent or carrier. 

The invention also provides a polypeptide according to the invention or an antibody for use in a method of therapy 
or diagnosis on a human or animal body. 

It is known that most cytokines and growth factors, as well as a number of membrane proteins, have a signal 
peptide on their N-terminL The signal peptide is a highly hydrophobic region immediately downstream of the translation 
initiation amino acid Met. It was confirmed that the polypeptide of the present invention has a signal peptide which is 
located from methionine (Met) at the 1 st position to serine (Ser) at the 23rd position in the amino acid sequence shown 
in SEQ ID NO. 3 (see Example 1 1 ). The essential sequence for biological activity (the mature protein sequence) is the 
amino acid sequence where the signal peptide is removed from the amino acid sequence shown in SEQ ID No. 3 i.e. 
the sequence corresponding to SEQ ID No. 1. The signal peptide is not required for biological activity. 

The polypeptides of the present invention include fragments in which.part of the amino acid sequence is lacking 
(e.g., a polypeptide comprised of only essential sequence for revealing a biological acitivity in a mature protein), homo- 
logues in which a part of the amino acid sequence is replaced by other amino acids (e.g., those replaced by an amino 
acid having a similar property) and homologues in which other amino acids are added or inserted into a part of their 
amino acid sequence, as well as those having the exact amino acid sequence shown in SEQ ID Nos. 1 and 3. 

As is known well, there are from one to six different codons which can encode each amino acid (for example, one 
kind of codon for Met, and six kinds of codon for leucine (Leu) are known). Accordingly, the nucleotide sequence of 
DNA can be changed in order to encode the polypeptide having the same amino acid sequence. 

The DNA's of the present invention, specified in (2) and (5) include every nucleotide sequence encoding polypep- 
tides shown in SEQ ID NOs. 1 and 3, as well as fragments and homologues thereof. There is a probability of improving 
a yield of production of a polypeptide by changing a nucleotide sequence. 

The DNAs specified in (3) SEQ ID No.1 and (6) SEQ ID No. 3 are the specified embodiments which have been 
isolated as the natural DNA's encoding the polypeptide of SEQ ID Nos. 1 and 3. 

The DNA (7) shown in SEQ ID No. 5 is the sequence of the DNA specified in (6) with a non-trans lational region. 

The DNA having a nucleotide sequence shown in SEQ ID NO. 5 may be prepared according to the following 
methods, that is, 

(i) by isolating mRNA from a cell line which secretes the polypeptide of the present invention (e.g., human gliob- 
lastoma cell line), 

(ii) by preparing first strand (single stranded DNA) from mRNA thus obtained, followed by preparing second strand 
(double stranded DNA) (synthesis of cDNA), 

(iii) by inserting cDNA thus obtained into a proper plasmid vector, 

(iv) by transforming host cells with the recombinant DNA thus obtained (preparation of cDNA library), 

(v) by random-cloning on a large scale from cDNA library thus obtained, followed by sequencing about 300 bases 
from 5' end of each clone, and 

(vi) by sequencing complete length of a clone which has a novel base sequence. 

Explained in detail, step (i) may be carried out in accordance with the method of Okayama, H. era/, (described in 
Methods in Enzymology, vol. 154, p 3, 1987) after a human glioblastoma cell line is stimulated by a proper stimulant 
(e.g., IL-1 etc.). Examples of the cells which secrete the polypeptide of the present invention is preferably human 
glioblastoma cell line T98G (ATCC strain No,, CRL-1690). Steps (ii), (iii) and (iv) are a series of steps for preparing 
cDNA library, and may be carried out in accordance with the method of Gubler & Hoffman {Gens, vol. 25, p. 263, 1 983) 
with a slight modification. As examples of the plasmid vector used in the step (iii), many vectors functioning in an £ 
constrain (e.g., pBR 322) and in a Bacillus subtilis (e.g., pUB 110) are known, and pVfCS-1 (described in detail in the 
following Example) prepared from pGEM-37f(+) (3,199 bp, manufactured by Promega Corp.) which functions in an £ 
col'u may be preferably used. As examples of host used in the step (iv), many cells are already known. Any cells may 
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be used, and DH5 competent cell which has been prepared in accordance with the method described in Gene, vol. 
96, p. 23, 1 990, may be preferably used. The cloning in the step (v) may be carried out by methods known per se and 
the sequencing may be carried out in accordance with the method of Maxam-Gilbert or the dideoxy termination method. 
The step (vi) may be carried out in accordance with the method described in Molecular Cloning (written by Sambrook, 
J., Fritsch, E. F. and Maniatis, T, published by Cold Spring Harbor Laboratory Press in 1989). 

As the following step, it is necessary to examine whether or not the DNA thus obtained codes right a secretory 
protein. The examination requires; 

(I) the conversion of the DNA sequence into the amino acid sequence in a possible frame, 

(II) the preparation of hydrophobic ity profile from the amino acid sequence thus obtained, followed by confirmation 
of the existence of a highly hydrophobic region just after the translation initiation codon (ATG) (secretory proteins 
have highly hydrophobic signal peptides on their N-termini), and then 

(III) the confirmation that the DNA thus obtained covers complete or almost complete length of intact mRNA. These 
confirmation may be carried out after the step (vi) hereinbefore described, and effectively between the step (v) 
and the step (vi). 

In the above examination, the step (II) may be carried out by analyzing using a computer making use of a hydro 
table of Kyte, J. and Doolittle, R. F., and an available software, e.g., DNASIS (Hitachi Software Engineering Co., Ltd.). 
The Step (III) may be carried out by northern analysis. 

Once the nucleotide sequences shown in SEQ ID NOs. 2, 4 and 5 are determined, DNA of the present invention 
may be obtained by chemical synthesis, by PGR method or by hybridization making use of a fragment of DNA of the 
present invention, as a probe. Furthermore, DNA of the present invention may be obtained in a desired amount by 
transforming with a vector DNA inserted a DNA of the present invention into a proper host, followed by culturing the 
transformant. 

The polypeptides of the present invention (shown in SEQ ID NOs. 1 and 3) may be prepared by: 

(1) isolating and purifying from an organism or a cultured cell, 

(2) chemically synthesizing, or 

(3) using a skill of biotechnology, preferabaly, by the method described in (3). 

Examples of expression system when preparing a polypeptide by using a skill of biotechnology, is, for example, 
the expression system of bacteria, yeast, insect ceil and mammalian cell. 

For example, the expression in £ co//may be carried out by adding the initiation codon (ATG) to 5' end of a DNA 
encoding a mature protein (e.g., DNA encoding a nucleotide sequence shown in SEQ ID NO. 2), connecting the DNA 
thus obtained to the downstream of a proper promoter (e.g., trp promoter, lac promoter, XP L promoter, T7 promoter 
etc.), and then inserting it intoa vector (e.g., pBR322, pUC18, pUC1 9 etc.) which functions in an £ constrain to prepare 
an expression vector. Then, an £ coli strain (e.g., £ colt DH1 strain, £ cofi JM109 strain, £ colt HB101 strain, etc.) 
which is transformed with the expression vector thus obtained may be cultured in a proper medium to obtain the desired 
polypeptide. When a signal peptide of bacteria (e.g., signal peptide of pel B) is utilized, the desired polypeptide may 
be also secreted in periplasm. Furthermore, a fusion protein with other polypeptide may be also produced easily. 

Furthermore, the expression in a mammalian ceil may be carried out, for example, by inserting the DNA shown in 
SEQ ID NO. 5 into the downstream of a proper promoter (e.g., SV40 promoter, LTR promoter, metallothionein promoter 
etc.) in a proper vector (e.g., retrovirus vector, papilloma virus vector, vaccinia virus vector, SV40 vector, etc.) to obtain 
an expression vector, and transforming a proper mammalian cell (e.g., monkey COS-7 cell, Chinese hamster CHO cell, 
mouse L cell etc.) with the expression vector thus obtained, and then culturing the transformant in a proper medium 
to get a desired polypeptide in the culture medium. The polypeptide thus obtained may be isolated and purified by 
conventional biochemical methods. 

The polypeptide of the present invention is synthesized in and secreted from a human glioblastoma cell line, and 
therefore, may possess biological activities relating to differentiation, proliferation and growth, of glias or neurons, 
relating to the function of immunity and relating to proliferation and growth, of tumors. The polypeptide of the present 
invention may be useful for the prevention of or in the treatment of aplasia or abnormal proliferation, of glias or neunons, 
depression or enhancement of immunological activity, or tumors etc. 

Further, polyclonal or monoclonal antibody against the polypeptide of the present invention can be used in the 
determination of the amount of the said polypeptide in organism, and thereby, may be utilized for the purpose of in- 
vestigating the relationship between the said polypeptide and diseases, or for the purpose of diagnosing diseases, and 
the like. Polyclonal and monoclonal antibody thereof may be prepared by conventional methods by using the'said 
polypeptide or the fragment thereof as an antigen. 

The DNA of the present invention may be utilized as an important and essential template in preparing the polypep- 
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tide of the present invention which is expected to possess various use or for diagnosis of and in the treatment of gene 
diseases (the treatment of gene defect disease and the treatment by inhibiting expression of the polypeptide by anti- 
sense DNA (RNA), and the like). Further, genomic DNA may be isolated by using the DNA of the present invention as 
a probe. Similarly it is possible to isolate genes having high homology to the DNA of the present invention in human 
or those of other species. 

Examples 

The following examples are illustrated, but not limit, the present invention. 
Example 1: Construction of vector for use in the preparation of cDNA library 

A plasmid vector, pGEM-3Zf(+) (3,ig9 bp, manufactured by Promega Corp.) was digested with H/ndlli, followed 
by Klenow treatment and circularized again. The necessary amount of this plasmid was recovered from culture of E 
colt transformed with the plasmid. Next, an Aatl\-Nde\ fragment was cut out from the plasmid, and the resulting linear 
plasmid fragment was smoothended using T4 polymerase. After ligating the thus treated termini with a HindlU linker 
the resulting fragment was digested with HindlU, circularized again and then transformed into an E. co/zstrain to recover 
plasmid. 

Thereafter, a Sad-Psfi portion in the polylinker of the plasmid was replaced by a synthetic polylinker shown below 

Noil 

Sat BgM KpnW I I 1 

I II II 1 

5 ' CGTCGACAGATCTGGTACCCGGGCGGCCGCACTAGTCTGCA 3 ' 

3 ' TCGAGCAGCTGTCTAGACCATGGGCCCGCCGGCGTGATCAG 5 ' 

The plasmid vector thus constructed (see Fig. 1) was named pVfCS-1. 

The pVfCS-1 has the following characteristic properties as a multi-purpose plasmid vector. 



1. Okayama-Berg method and Gubler-Hoffman method can be applied. 

2. Plasmid yield per cultured cells is high. 

3. Single-stranded DNA can be prepared. 

4. A cDNA insert can be cut out easily. 

35 5. Preparation of a deletion mutant for sequencing use can be made easily 

6. In vitro transcription can be made. 

Example 2: Isolation and purification of mRNA 



A total of 3 x 107 cells of a human g | iob , astoma cell line T98G (ATCC strain No., CRL-1690) were stimulated for 
4 hours wrth 100 units/ml of human IL-1p, subsequently isolating mRNA in accordance with the method of Okayama 
K et al (Methods in Enzymology, vol. 154, p. 3, 1987) as described below 

That is, the stimulated cells were solubilized with a 5.5 M GTC solution (5.5 M guanidine thiocyanate 25 mM 
sodium citrate, 0.5% sodium lauryl sarcosine), and the resulting cell lysate was laid on a cushion of a cesium trifluor- 
oacetate (CsTFA) solution having a density of 1 .51 and centrifuged (120,000 x g, 20 hours) to recover 1 26 mg of total 
RNA in the resulting pellet. Thereafter, the RNA sample was passed twice through an oligo(dT)-cellubse column to 
purify and recover 46 \lq of poly(A)+RNA. 

Example 3: Preparation of cDNA library 

A cDNA library was prepared in accordance with the method of Gubler & Hoffman (Gene, vol 25 d 263 19831 
with a slight modification. ' 

A first strand was synthesized from the poly(A)+ RNA (5 jxg) prepared in Example 2, using a reverse transcriptase 
and an oligo(oT) primer having a Atoll site. After synthesizing a second strand and carrying out Sah adaptor ligation 
and Not digestion, the adaptor and the primer were removed by gel filtration column chromatography using a column 
packed with Sephacryl S-500HR (available from Pharmacia), thereby recovering a fraction containing 820 ng of cDNA 
The above cDNA synthesizing step was effected making use of a kit (Super Script System, available from BRL) 
Separately from this, a vectorwas prepared by subjecting the pVfCS-1 obtained in Example 1 to complete digestion 
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with Non, digesting the product further with Sah, subjecting the resulting digest to 0.8% agarose gel electrophoresis 
to cut out a band of rnterest and then purifying the vector of interest making use of a kit for glass powder method use 
(GENECLEAN II, available from BIO 101). 

After subjecting the thus prepared cDNA and vector to ligation, the resulting product was transfected into DH5 
competent cells which have been prepared in accordance with the method of Inoue, H. et al (Gene, vol 96 p 23 
1990). As the results, a cDNA library containing 6 x 105 independent clones with an average length of 1 5 kb was 
obtained. 

Example 4 : Cloning and Sequencing 

Using the cDNA library prepared in Example 3, clones were plated on LB^roth agar containing ampicilin with a 
density of 300 colonies/dish having a diameter of 1 0 cm. Cloning was carried out by picking up the colonies at random 
Each of the colonies was cultured overnight in 3 ml of LB-broth. A 200 u.l portion of the resulting culture was mixed 
wrth dimethylsulfoxide (DMSO, final concentration of 7%) and stored at -80°C, and the remaining portion of the culture 
was used for the isolation of plasmid. The purification of plasmid was carried out by the usual way. 

Since the plasmid has a structure shown in Fig. 2, its nucleotide sequence can be read from the 5* end of the 
cloned cDNA when sequencing is carried out using T7 primer. 

DNA sequencing was carried out in accordance with a cycle sequence method based on the dideoxy termination 
method of Sanger, F. or al., using a fluorescence dye terminator of ABI (Applied Biosystems Inc.). Reading of the 
sequence was carried out using a DNA sequenser of ABI (Model 373A). 

In this way, a nucleotide sequence having a length of about 300 bases from 5' end of each cDNA was obtained. 

Example 5 : Analysis of partial sequence data 



Using the FASTA program of Lipman, D. J. and Pearson, W. Ft., the nucleotide sequence obtained in Example 4 
was searched for its homology with every nucleotide sequence contained in known data bases (GenBank and EMBL) 
As the results, a clone having an unknown sequence was identified. The unknown nucleotide sequence was converted 
into amino acid sequences in possible three frames. 

Next, hydrophobic and hydrophilic profiles of the amino acid sequences deduced from their corresponding three 
frames were analyzed graphically using a computer making use of a hydro table of Kyte, J. and Doolittle, R F In this 
instance, DNASIS (Hitachi Software Engineering Co., Ltd.) was used as a computer software. 

It is known that most of cytokines and growth factors, as well as a number of membrane proteins, have signal 
peptides on their N-termmi. Signal peptide is a highly hydrophobic region immediately downstream of the translation 
initiation ammo acid Met. In a nucleotide sequence, it is located just after the initiation codon (ATG) of its coding region 
In consequence, a cDNA having a putative signal peptide^ontaining protein was screened on the basis of data on the 
open reading frame of the clone having unknown sequence and on the hydrophobic and hydrophilic properties of the 
amino acid sequence. As the results, a clone of interest (clone No. TG0077) was found which showed signal peptide- 
specific nature in one of its three possible amino acid sequence frames. (See Fig. 3 which shows a hydrophobicity 
profile of the total amino acid sequence. In the figure, the V side is a highly hydrophobic region and the ■-■ side is a 
40 region having weak hydrophobicity.) 

It is possible however that the cloned cDNA does not cover complete length of mRNA. If it does not cover the 
complete length, it is less possible that the clone contains the N-terminal amino acid sequence moiety. 

In consequence, northern analysis was carried out in order to determine if the clone TG0077 has complete length 
or not. That is, poly(A)+RNA which has been extracted and purified from a glioblastoma cell line was subjected to 
electrophoresis and then blotting on nylon membrane. When hybridization was carried out using a TG0077 cDNA insert 
as a probe, a single band was found at a position corresponding to about 700 bp. Since the TG0077 cDNA insert had 
a size of about 700 bp, it was confirmed that the TG0077 clone was a almost complete length cDNA. 

Example 6: Determination of complete sequence of the cDNA and open reading frame 

The complete length cDNA sequence was determined by means of random sequencing in accordance with the 
method disclosed in Molecular Cloning (Sambrook, J., Fritsch, E. F. and Maniatis, T, Cold Spring Harbor Laboratory 
Press, 1 989). 

Plasmid was recovered from the TG0077 clone to isolate and purify a cDNA insert. The insert thus purified was 
subjected to ligation and fragmentation, followed by smooth-ending of both termini of the resulting DNA fragment with 
T4 polymerase, thereby recovering a DNA fragment of about 400 bp in length by agarose gel electrophoresis The 
thus obtained DNA fragment was cloned into a Smal site of a plasmid vector BLUESCRIPT II (available from Strata- 
gene) and then transfected into an £ coli strain. A total of 20 colonies were picked up at random to prepare 20 corre- 
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spond.ng plasmid DNA samples (every of them contained a TG0077 cDNA fragment as an insert) which were subse- 
quently subjected to DNA sequencing. DNA sequencing and sequence reading were carried out in the same manner 
as the procedure described in Example 4. Sequence data of the TG0077 cDNA fragment were arranged into a continued 
sequence making use of a DNA sequence connection program of DNASIS, thereby obtaining a base sequence shown 
in SEQ ID NO. 5. An open reading frame was determined based on the complete length cDNA sequence data todeduce 
corresponding amino acid sequence, with the results shown in SEQ ID NO. 3. 

Example 7 : Construction of plasmid vector for use in the preparation of expression vector 

A vector, called pcD-SRa296. constructed by Takebe et al. (Mol. Cell. Biol., vol. 8, p. 966, 1988) is an excellent 
expression vector for use in animal cells, which contains a promoter system (SRo) consisting of SV40 earfy promoter. 

region of LTR of HTLV -' and a P art of us sequence. This vector, however, has disadvantages in that (1 ) it has only' 
one insert-cloning site, namely an EcoRI site, and (2) its recovery yield from E. coil is small because of the use of 
pBR322 vector as its basic structure. In consequence, an attempt was made to modify the P cD-SRo296 vector in the 
following manner, with the aim of constructing a modified vector containing multiple insert-cloning sites and havinq a 
basic structure derived from pUC19 which is effective in recovering the resulting modified vector in a high yield from 

E. COlt. 

The pcD-SRa296 vector (provided by Dr. Takebe at National Institute of Health) was digested with Sail and the 
resulting digest was subjected to agarose gel electrophoresis to separate and recover a 1 .7 kbp fragment containing 
the SRa promoter, subsequently smooth-ending the thus obtained fragment by Klenow treatment 

Separately from this, pUC1 9 vector was digested with A/cfel and W/ndlll and the resulting digest was subjected to 
agarose gel electrophoresis to separate and recover a 2.4 kbp fragment containing Amry and pUCori regions subse- 
quently smooth-ending the thus obtained fragment by Klenow treatment and thereafter removing the 5'-end phosphoric 
acid group by BAP (bacterial alkaline phosphatase) treatment. 

The thus prepared 1 .7 kbp fragment containing the SRa promoter was ligated with the pUCori-containing 2 4 kbp 
fragment to make them into a circular form, thereby effecting construction of a new vector. A Pst-Kprt fragment was 
removed from the thus obtained vector and replaced with the following synthetic polylinker. 

, Sad San BaHl fcoRI nm Swal 
5 . 1 11 " « 1 II 1 

^ ^ GGAGCTCGTCGACAGATCTGAATTCCATATGCCCGGG 
ACGTCCTCGAGCAGCTGTCTAGACTTAAGGTATACGGGCCC 

No* Spa 

r 1 1 

GCGGCCGCACTAGTGGTAC 3 ' 
CGCCGGCGTGATCAC S ' 

The plasmid vector constructed in this manner (about 3.9 kbp, see Fig. 4) was named pUC-SRaMU. 

Example 8: Construction of expression vector 

A cDNA insert was cut out using Sa» and Spel from plasmid which has been isolated from the TG0077 clone 
prepared in Example 5, and the cDNA insert was inserted into a Sal -Spel site of the pUC-SRoML1 obtained in Example 
7, thereby obtaining an expression vector, pUC-SRaML1 -TG0077A. 

Separately from this, oligodeoxynucleotides represented by the following formulae: 



TAGAGAGTCGACCACGTGGCTGCCACCCAGGCCTCTCTGCTC 



and 



AAGAGAACTAGTTCAATGATGATGATGATGATGTGGCAGG 
ACTGTG ATAG AG TCAGAAACTGA 3 " 

were synthesized, and the TG0077 plasmid was subjected to PCR (polymerase chain reaction) using the thus synthe- 
s.zed oligodeoxynucleotides as primers. The thus obtained PCR fragment contains cDNA which encodes a protein 
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molecule consisting of the TG0077 protein and six additional histidine (His) residues attached to its C-terminal end 
The PCR fragment was digested with Sah and Spei, and the resulting digest was separated and purified and then 
inserted into the Sarf-Spe! site of the pUC-SRccML1 prepared in Example 7 to obtain an expression vector pUC- 
oHaMLi-TG0077B. 

Each of the thus constructed pUC-SRccML1 -TG0077A and P UC-SR«ML1 -TG0077B plasmids was transfected into 
an £ coll strain DH5, recovered from a 100 ml culture of the resulting transformant and then purified by CsCI density 
gradient centnfugation twice. In this manner, 592 ug and 546 ug of the respective plasmids were recovered. 

Example 9 . Expression in COS cells 

Each of the plasmid DNA preparations pUC-SRctML1 , pUC-SRoMLI -TG0077A and pl)C-SRaML1 -TG0077B was 
introduced into COS-7 cells (Cell, vol. 23, p. 175, 1981) by means of the diethylaminoethyl (DEAE) dextran method 
(J. Immunology, vol. 136, p. 4291, 1986). 

That is about 1 .8 x 10» COS-7 cells were inoculated into a 225 err* capacity flask (manufactured by Coming) 
toge her -with 50 ml of a liquid culture medium (Dulbecco's modified MEM medium supplemented with 10% decomple- 
mented fetal bovine serum). After overnight incubation in a carbon dioxide incubator (37"C, 5% CO,) and subsequent 
removal of the culture supernatant, 1 2 ml of a DNA cocktail (Dulbecco's modified MEM medium supplemented with 1 5 
ug of each plasmid DNA, 50 mM Tris-HCI buffer (pH 7.4) and 400 ug/ml of DEAE-dextran) was added to each flask 
and culture was carried out for 3 hours at 37'C in an atmosphere of 5% CO a . Thereafter, the DNA cocktail was replaced 
by 15 ml of achloroquine solution (Dulbecco's modified MEM medium supplemented with 1 50 nM chloroquine and 7% 
decomplemented fetal bovine serum), followed by additional 3 hours of cuflure. 

After removing the chloroquine solution, the aforementioned liquid culture medium (50 ml) was added to each of 
the resulting flasks which were then incubated at 37'C in an atmosphere of 5% CO a for 72 hours to find growth of the 
cells in each flask into almost monolayer form. After removing the culture supernatant, the cells in each flask were 
washed with a serum-free liquid culture medium (trade name, SFM-101; available from Nissui Pharmaceutical Co 
Ltd. and then supplied with 75 ml of the same serum-free liquid culture medium, and the culturing was continued for 
another 72 hours. Thereafter, the resulting culture supernatant was recovered and filtered through a membrane fitter 
(trade name. STERIVEX-GS; available from Miiiipore Corp.) to remove the cells and cell debris. The thus obtained 
culture supernatant samples were stored at 4'C for future use. A culture supernatant of COS cells which have been 
transformed with plasmid containing the TG0077 cDNA insert is expected to contain expressed and secreted mature 
protein moiety of a polypeptide which corresponds to TG0077. 

Example 10 : Confirmation of expression 



A 2 ml portion of each of theculture supernatants of transformed COS cells obtained in Example 9 was concentrated 
to a volume of 100 ul using a centrifugal concentration filter (trade name, Centricon-1 0; available from Miiiipore Corp ) 
A 1 Hi portion of each of the thus concentrated samples was mixed with the same volume of a loading buffer (0 125 M 
Tris-HCI buffer (pH 6.8), 4% sodium dodecyl sulfate and 30% glycerol) for SDS-PAGE (sodium dodecyl sulfate pofv- 
40 Sge eleCtr0ph0resis) use ' and the mixture was ,reate d at 90»C for 3 minutes and then subjected to SDS- 

In the case of the TG0077B protein having His hexamer introduced to the C-terminus of the protein not only its 
corresponding COS cell culture supernatant but also its purified product were subjected to the SDS-PAGE analysis 

Purification of the protein was carried out by means of a metal chelate affinity chromatography (Biotechnology, 
vol. 9, p. 273. 1 991 ), making use of the function of His to form complex compounds with various transition metal tons 
That is, a culture supernatant (350 ml) obtained from COS cells was mixed with a sodium chloride aqueous solution 
in such an amount that the final concentration of the salt became 1 M, and the resulting mixture was applied to a column 
packed wrth 4 ml of a zinc-linked chelating Sepharose (trade name, Chelating Sepharose Fast-Flow; available from 
Pharmacia) to adsorb the protein to the resin. The column was washed with 50 mM phosphate buffer (pH 7 0) containing 
1 M sodium chloride aqueous solution (40 ml), and the protein retained in the column was eluted with 50 mM phosphate 
buffer (pH 7.0) containing 1 M sodium chloride aqueous solution and 0.4 M imidazole. Thereafter, 1 2 ml of the resuftinq 
eluate was concentrated to a volume of 100 ul, and a 1 uJ portion of the concentrated sample was subjected to SDS- 
PAGE analysis. 

/on J?n S i™'c P no ™ ' ySi t WaS Carried ° Ut maki " 9 USe of a Phast Sys,em bailable from Pharmacia) using a gel 
(Phast Gel™ SDS 1 0/20 gradient, available from Pharmacia). Results of silver staining are shown in Fig 5 

In the figure, lane 1 indicates a concentrated culture supernatant of COS cells transformed with pUC-SRoMLI 
lane 2 indicates a concentrated culture supernatant of COS cells transformed with pUC-SRoML1 -TG0077B lane 3 
indicates a non-adsorbed fraction obtained by the column purification of the culture supernatant of COS cells trans- 
formed with pUC-SRoMLI -TG0077B, lane 4 indicates a fraction eluted with imidazole at the time of the same column 
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purification, lane 5 indicates a concentrated culture supernatant of COS cells transformed with pUC-SRaML1 -TG0077A 
and lane M represents molecular weight markers. 

A band which corresponds to a molecular weight of about 1 5 to 16 kD was found on the lanes 2, 4 and 5 but not 
on the lanes 1 and a This band was found to be originated from the expressed TG0077 protein, because its position 
coincided wrth a molecular weight calculated from the amino acid sequence of the TG0077 protein. Recovered amount 
of the protein purified by the metal chelate affinity column chromatography was estimated to be about 20 ug based on 
the comparison of the silver-stained SDS-PAGE band with a standard band of human IL-4. 

Example 11: Amino acid sequencing of expressed protein 

Amino acid sequence of the purified protein obtained by the metal chelate affinity column chromatography was 
determined starting from its N-terminal end, using a gas phase sequenser in accordance with the method of Matsudaira 
etai. (J. Biol Chem., vol. 262, p. 10035, 1987). 

Firstly, approximately 100 pmol of TG0077B protein (purified product obtained in Example 10) was subjected to 1 
hour of SDS-PAGE at 60 mA using 1 0 to 20% polyacrylamide gradient gel (available from Daiichi Pure Chemical Co 
Ltd.). After completion of the electrophoresis, the resulting gel was equilibrated with a blotting buffer [10 mM CAPS 
(3-cyclohexylammo-1 -propane sulfonate) and 10% methanol] and then subjected to 2 hours of electro-blotting at 0 4 
A, using a blotting apparatus (available from Marisol), on a PVDF (polyvinylidene difluoride) membrane (trade name 
Pro Blott; available from A8I) which has been equilibrated with the same blotting buffer. After completion of the blotting' 
the resulting PVDF membrane was washed three times with distilled water, stained for 1 minute with 0 1% (w/v) 
Coomassie Blue and then decolorized by 50% methanol. Thereafter, a band of interest was cut out and applied to a 
protein sequenser (Model 470A, equipped with an on-line PTH analyzer Model 120A; available from ABI). 

As the results, the sequence from the N-terminus was determined as follows. 

Gly-X r Pro-Pro-X 2 -Gln-X 3 -Ser-X 4 



(In the above formula, X, to X4 indicate that respective amino acid residues were not able to be judged clearly on the 
sequenser chart.). This sequence coincided with a downstream portion, starting at the 24 position amino acid residue 
of the amino acid sequence deduced from the cDNA sequence. Based on these results, it was confirmed that the 
TG0077 protein has a signal peptide of 23 amino acid residues on its N-terminal side and the N-terminus of the mature 
protein is glycine (Gly). 

Complete nucleotide sequence of the TG0077 cDNA and primary amino acid sequence of the TG0077 protein 
encoded by the nucleotide sequence are shown in SEQ ID NO. 6. 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: 



(A) NAME: Ono Pharmaceutical Co Ltd 

(B) STREET: 1-5 Doshomachi 2-Chome 

(C) CITY: Osaka 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP): 541 



(ii) TITLE OF INVENTION: Novel Polypeptides and DNAs Encoding Them 

(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 



(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPUCATION DATA: 



APPLICATION NUMBER: EP 93301571.1 
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(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..399 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGA AGC CCT CCG ACC CAG CCC TCG CCG GCC TCG GAT TCC GGC TCT GGC 
Gly Ser Pro Pro Thr Gin Pro Ser Pro Ala Ser Asp Ser Gly Ser Gly 
15 10 15 

TAC GTT CCG GGC TCG _w x^r GCA GCC TTT GTT ACT TGC CCC AAC GAG 
Tyr Val Pro Gly Ser Val Ser Ala Ala Phe Val Thr Cys Pro Asn Glu 
20 25 30 

AAG GTC GCC AAG GAG ATC GCC AGG GCC GTG GTG GAG AAG CGC CTA GCA 
Lys Val Ala Lys Glu lie Ala Arg Ala Val Val Glu Lys Arg Leu Ala 
35 40 45 

GCC TGC GTC AAC CTC ATC CCT CAG ATT ACA TCC ATC TAT GAG TGG AAA 
Ala Cys Val Asn Leu lie Pro Gin lie Thr Ser lie Tyr Glu Trp Lys 
50 55 60 

GGG AAG ATC GAG GAA GAC AGT GAG GTG CTG ATG ATG ATT AAA ACC CAA 
Gly Lys lie Glu Glu Asp Ser Glu Val Leu Met Met lie Lys Thr Gin 
65 70 75 80 



AGT TCC TTG GTC CCA GCT TTG ACA GAT TTT GTT CGT TCT ' CiTG )ZAC CCT. 
Ser Ser Leu Val Pro Ala Leu Thr Asp Phe Val Arg Ser Val His Pvo 
85 90 95" ' 

TAC GAA GTG GCC GAG GTA ATT GCA TTG CCT GTG GAA CAG GGG AAC TTT 
Tyr Glu Val Ala Glu Val He Ala Leu Pro Val Glu Gin Gly Asn Phe 
100 105 110 

CCG TAC CTG CAG TGG GTG CGC CAG GTC ACA GAG TCA GTT TCT GAC TCT 
Pro Tyr Leu Gin Trp Val Arg Gin Val Thr Glu Ser Val Ser Asp Ser 
115 120 125 



ATC ACA GTC CTG CCA 
He Thr Val Leu Pro 
130 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Gly Ser Pro Pro Thr Gin Pro Ser Pro Ala Ser Asp Ser Gly Ser Gly 
1 5 10 15 

Tyr Val Pro Gly Ser Val Ser Ala Ala Phe Val Thr Cys Pro Asn Glu 
20 25 30 

Lys Val Ala Lys Glu He Ala Arg Ala Val Val Glu Lys Arg Leu Ala' 
35 40 45 

Ala Cys Val Asn Leu He Pro Gin He Thr Ser He Tyr Glu Trp Lys 
50 55 60 

Gly Lys lie Glu Glu Asp Ser Glu Val Leu Met Met He Lys Thr Gin 
65 70 75 80 

Ser Ser Leu Val Pro Ala Leu Thr Asp Phe Val Arg Ser Val His Pro 

85 90 95 

Tyr Glu Val Ala Glu Val He Ala Leu Pro Val Glu Gin Gly Asn Phe 
100 105 no 

Pro Tyr Leu Gin Trp Val Arg Gin Val Thr Glu Ser Val Ser Asp Ser 
115 120 125 

He Thr Val Leu Pro 
130 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..468 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
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ATG CCG GCG CTG CTG CCT GTG GCC TCC CGC CTT TTG TTG CTA CCC CGA 
Met Pro Ala Leu Leu Pro Val Ala Ser Arg Leu Leu Leu Leu Pro Arg 
15 10 15 



48 



GTC TTG CTG ACC ATG GCC TCT GGA AGC CCT CCG ACC CAG CCC TCG CCG 
Val Leu Leu Thr Met Ala Ser Gly Ser Pro Pro Thr Gin Pro Ser Pro 
20 25 30 



96 



10 



GCC TCG GAT TCC GGC TCT GGC TAC GTT CCG GGC TCG GTC TCT GCA GCC 
Ala Ser Asp Ser Gly Ser Gly Tyr Val Pro Gly Ser Val Ser Ala Ala 
35 40 45 



144 



1S 



20 



25 



TTT GTT ACT TGC CCC AAC GAG AAG GTC GCC AAG GAG ATC GCC AGG GCC 192 
Phe Val Thr Cys Pro Asn Glu Lys Val Ala Lys Glu He Ala Arg Ala 
50 55 60 

GTG GTG GAG AAG CGC CTA GCA GCC TGC GTC AAC CTC ATC CCT CAG ATT 24 0 

Val Val Glu Lys Arg Leu Ala Ala Cys Val Asn Leu He Pro Gin He 
65 70 75 80 

ACA TCC ATC TAT GAG TGG AAA GGG AAG ATC GAG GAA GAC AGT GAG GTG 288 
Thr Ser lie Tyr Glu Trp Lys Gly Lys He Glu Glu Asp Ser Glu Val 
85 90 95 

CTG ATG ATG ATT AAA ACC CAA AGT TCC TTG GTC CCA GCT TTG ACA GAT 336 
Leu Met Met He Lys Thr Gin Ser Ser Leu Val Pro Ala Leu Thr Asp 
100 105 no 



30 



35 



TTT GTT CGT TCT GTG CAC CCT TAC GAA GTG GCC GAG GTA ATT GCA TTG 384 
Phe Val Arg Ser Val His Pro Tyr Glu Val Ala Glu Val He Ala Leu 
115 120 125 

CCT GTG GAA CAG GGG AAC TTT CCG TAC CTG CAG TGG GTG CGC CAG GTC 432 
Pro Val Glu Gin Gly Asn Phe Pro Tyr Leu Gin Trp Val Arg Gin Val 
130 135 140 

ACA GAG TCA GTT TCT GAC TCT ATC ACA GTC CTG CCA 468 
Thr Glu Ser Val Ser Asp Ser He Thr Val Leu Pro 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 4: 
4° (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 56 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



45 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Pro Ala Leu Leu Pro Val Ala Ser Arg Leu Leu Leu Leu Pro Ara 
50 1 5 10 15 



55 
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Val Leu Leu Thr Met Ala Ser Gly Ser Pro Pro Thr Gin .Pro *Ser Pro 
20 25 * 3p' 

Ala Ser Asp Ser Gly Ser Gly Tyr Val Pro Gly Ser Val Ser Ala Ala 
35 40 45 

Phe Val Thr Cys Pro Asn Glu Lys Val Ala Lys Glu He Ala Arg Ala 
50 55 60 

Val Val Glu Lys Arg Leu Ala Ala Cys Val Asn Leu He Pro Gin He 
65 70 75 80 

Thr Ser He Tyr Glu Trp Lys Gly Lys He Glu Glu Asp Ser Glu Val 

85 90 95 

Leu Met Met He Lys Thr Gin Ser Ser Leu Val Pro Ala Leu Thr Asp 
100 105 no 

Phe Val Arg Ser Val His Pro Tyr Glu Val Ala Glu Val He Ala Leu 
115 120 125 

Pro Val Glu Gin Gly Asn Phe Pro Tyr Leu Gin Trp Val Arg Gin Val 
130 135 140 

Thr Glu Ser Val Ser Asp Ser He Thr Val Leu Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 67.. 537 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 5: 
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10 



TGGTGCGCGC TGTTTTCTAA TCACGTGGCT GCCACCCAGG CCTCTCTGCT CCTGTCTTTT 60 

GTTTGG ATG CCG GCG CTG CTG CCT GTG GCC TCC CGC CTT TTG TTG CTA 108 
Met Pro Ala Leu Leu Pro Val Ala Ser Arg Leu Leu Leu Leu 
15 10 

CCC CGA GTC TTG CTG ACC ATG GCC TCT GGA AGC CCT CCG ACC CAG CCC 156 
Pro Arg Val Leu Leu Thr Met Ala Ser Gly Ser Pro Pro Thr Gin Pro 
15 20 25 30 

TCG CCG GCC TCG GAT TCC GGC TCT GGC TAC GTT CCG GGC TCG GTC TCT 204 
Ser Pro Ala Ser Asp Ser Gly Ser Gly Tyr Val Pro Gly Ser Val Ser 
35 40 45 

GCA GCC TTT GTT ACT TGC CCC AAC GAG AAG GTC GCC AAG GAG ATC GCC 252 
, 5 Ala Ala Phe Val Thr Cys Pro Asn Glu Lys Val Ala Lys Glu He Ala 
50 55 60 

AGG GCC GTG GTG GAG AAG CGC CTA GCA GCC TGC GTC AAC CTC ATC CCT 300 

20 Arg Ala Val Val Glu Lys Arg Leu Ala Ala Cys Val Asn" ;L6u ; life p±o \ 
65 70 75 i 

CAG ATT ACA TCC ATC TAT GAG TGG AAA GGG AAG ATC GAG GAA GAC AGT 348 
Gin He Thr Ser He Tyr Glu Trp Lys Gly Lys He Glu Glu Asp Ser 
2s 80 85 90 

GAG GTG CTG ATG ATG ATT AAA ACC CAA AGT TCC TTG GTC CCA GCT TTG 396 
Glu Val Leu Met Met He Lys Thr Gin Ser Ser Leu Val Pro Ala Leu 
95 100 105 110 

30 ACA GAT TTT GTT CGT TCT GTG CAC CCT TAC GAA GTG GCC GAG GTA ATT 444 

Thr Asp Phe Val Arg Ser Val His Pro Tyr Glu Val Ala Glu Val He 
115 120 125 



35 



40 



GCA TTG CCT GTG GAA CAG GGG AAC TTT CCG TAC CTG CAG TGG GTG CGC 492 
Ala Leu Pro Val Glu Gin Gly Asn Phe Pro Tyr Leu Gin Trp Val Arg 
130 135 140 

CAG GTC ACA GAG TCA GTT TCT GAC TCT ATC ACA GTC CTG CCA TGATGAGCCC 544 
Gin Val Thr Glu Ser Val Ser Asp Ser He Thr Val Leu Pro 
145 150 155 

TGTTCCTGCT CATCATGAAG ATCCCCGCGA TACTTCAACG CCTTCTGACT TCCAGGTGAT 6 04 

GACTGGGCCC CCAATAAATC CCGTCTTTGG GTCAAAAAAA AAAAAAAAAA AAAAAA 660 

45 (2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO' 6' 

ss 
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Met Pro Ala Leu Leu Pro Val Ala Ser Arg Leu Leu Leu Leu Pro Arg 
1 5 io is 

Val Leu Leu Thr Met Ala Ser Gly Ser Pro Pro Thr Gin Pro Ser Pro 
20 25 30 

Ala Ser Asp Ser Gly Ser Gly Tyr Val Pro Gly Ser Val Ser Ala Ala 
35 40 45 

Phe Val Thr Cys Pro Asn Glu Lys Val Ala Lys Glu He Ala Arg Ala 
50 55 60 

Val Val Glu Lys Arg Leu Ala Ala Cys Val Asn Leu He Pro Gin He 
g 5 70 75 80 

Thr Ser He Tyr Glu Trp Lys Gly Lys He Glu Glu Asp Ser Glu Val 

85 90 95 

Leu Met Met He Lys Thr Gin Ser Ser Leu Val Pro Ala Leu Thr Asp 
100 105 no 

Phe Val Arg Ser Val His Pro Tyr Glu Val Ala Glu Val He Ala Leu 
115 120 125 

Pro Val Glu Gin Gly Asn Phe Pro Tyr Leu Gin Trp Val Arg Gin Val 

130 135 140 

Thr Glu Ser Val Ser Asp Ser He Thr Val Leu Pro 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..45 

(D) OTHER INFORMATION: /note= This is a polylinker and has a 4-bp single stranded overhang at each 
end. Double stranded only from 5-41 ■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
AGCTCGTCGA CAGATCTGGT ACCCCGGCGG CCGCACTAGT CTGCA 45 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: miscjeature 

(B) LOCATION: 1..60 

z'ss "zrti;.* * a <*— - ■ «• **• — — . - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8; 
TGCAGGAGCT CGTCGACAGA TCTGAATTCC ATATGCCCGG GGCGGCCGCA CTAGTGGTAC 60 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TAG AG AGT CG ACCACGTGGC TGCCACCCAG GCCTCTCTGC TC 

42 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

AAGAGAACTA GTTCAATGAT GATGATGATG ATGTGGCAGG ACTGTGATAG AGTCAGAAAC 60 
TGA 



63 



Claims 



1. Apolypep^^^ 1 m substantially purified fofm ahomoloque 
hereof having at least 90% homobgy with the amino acid sequence shown in SEQ ID No 1 oZ ' it£to ofat 

52SEE£ h ° "* °* s - me - • - - ■ «-S125S: 

2. A polypeptide according to claim 1 having the amino acid sequence shown in SEQ ID No. 1 . 

3. A polypeptide comprising a polypeptide according to claim 1 or 2, having the amino acid sequence shown in SEQ 
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ID No. 3 in substantially purified form, a homologue thereof having at least 90% homology with the amino acid 
sequence shown in SEQ ID No. 3 over a region of at least SOcontiguous amino acids, or a fragment of the sequence 
or a fragment of such a homologue comprising at least 20 amino acids. 

4. A polypeptide according to claim 3 having the amino acid sequence shown in SEQ ID No. 3. 

5. DNA encoding a polypeptide according to any one of the preceding claims. 

6 " !™ Aa ^^ 1 orafragmentthereofcomprising 

at least 20 nucleotides capable of selectively hybridizing to SEQ ID No. 1 

7. DNA according to claim 5 or 6 having the nucleotide sequence shown in SEQ ID No. 3 or a fragment thereof 
compnsing at least 20 nucleotides capable of selectively hybridizing to SEQ ID No. 3. 

8. DNA according to claim 5, 6 or 7 having the nucleotide sequence shown in SEQ ID No. 5 or a fragment thereof 
compnsing at least 20 nucleotides capable of selectively hybridizing to SEQ ID No. 5. 

9. A replication and expression vector comprising DNA according to any one of claims 4 to 8. 

10. Host cells transformed or transfected with a replication and expression vector according to claim 9. 

11. A method of producing a polypeptide which comprises culturing host cells according to claim 10 under conditions 
effective to express a polypeptide according to any one of claims 1 to 4. 

12. A monoclonal or polyclonal antibody to a polypeptide according to any one of claims 1 to 4. 

13. A pharmacuetical composition containing a polypeptide according to any one of claims 1 to 4 or an antibody ac- 
cording to claim 1 2 in association with a pharmacuetically acceptible deluant of carrier. 

Patentanspruche 

1. Polypeptid mit der in der Sequenz mit der Identifizierungsnummer 1 dargestellten Aminosauresequenz in im we- 
sentl.chen gerein.gter Form, ein Homologes hiervon mit einer mindestens 90 % Homologie mit der in der Sequenz 

KK r i A •' erUn9SnUmmer 1 dar 9 estell,en Aminosauresequenz uber einen Bereich von mindestens 30 be- 
nachbarten Aminosauren oder ein Fragment der Sequenz Oder ein Fragment eines derartigen Homologen, das 
mindestens 20 Aminosauren umfaBt. 

2. Polypeptid nach Anspruch 1 mit der in der Sequenz mit der Identifizierungsnummer 1 dargestellten Aminosaure- 

sequenz. 

3 ' n™ Jh^' 38 ? T 6 T POlyPeP,id " aCh AnSprUCh 1 °* r 2 ' mit der in der Se " uen2 mit de ' Identifizierungs- 
nummer 3 dargestelrten Aminosauresequenz in im wesentlichen gereinigter Form, ein Homologes hiervon mit 
e.ner mmdestens 90 % Homologie mit der in der Sequenz mit der Identifizierungsnummer 3 dargestellten Amino- 
sauresequenz uber einen Bereich von mindestens 30 benachbarten Aminosauren oder ein Fragment der Sequenz 
oder ein Fragment eines derartigen Homologen, das mindestens 20 Aminosauren umfaBt. 

4. Polypeptid nach Anspruch 3 mit der in der Sequenz mit der Identifizierungsnummer 3 dargestellten Aminosaure- 

sequenz. 

5. DNA mit Kodierung fur ein Polypeptid nach einem der vorhergehenden Anspruche. 

6. DNA nach Anspruch 5 mit der in der Sequenz mit der Identifizierungsnummer 1 dargestellten Nukleotidsequenz 
Oder em Fragment h.ervon, das mindestens 20 NukleotkJe mit der Fahigkeit zur selektK/en Hybridisierung an die 
bequenz mit der Identifizierungsnummer 1 umfaBt. 

7. DNA nach Anspruch 5 oder 6 mit der in der Sequenz mit der Identifizierungsnummer 3 dargestellten Nukleotidse- 
quenz oder ein Fragment hieivon, das mindestens 20 Nukleotide mit der Fahigkeit zur selektiven Hybridisierung 
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an die Sequenz mit der Identifizierungsnummer 3 umfaBt. 

8. DNA nach Anspruch 5, 6 Oder 7 mit der in der Sequenz mit der Identifizierungsnummer 5 dargestellten Nukleo- 
tidsequenz Oder em Fragment hiervon, das mindestens 20 Nukleotide mit der Fahigkeit zur selektiven Hybridisie- 
rung an die Sequenz mit der Identifizierungsnummer 5 umfaBt. 

9. Replikations- und Expressionsvektor, der DNA nach einem der Anspruche 4 bis 8 umfaBt. 

10. Mit einem Replikations- und Expressionsvektor nach Anspruch 9 transformierte Oder transfizierte Wirtzellen. 

11. Verfahren zur Herstellung eines Poypeptids durch Zuchten von Wirtzellen nach Anspruch 10 unter Bedingunqen 
die zur Expression eines Polypeptids nach einem der Anspruche 1 bis 4 fuhren. ' 

12. Monoklonaler oder polyklonaler Antikorper zu dem Polypeptid nach einem der Anspruche 1 bis 4. 

13. Pharmazeutische Zubereitung, die ein Polypeptid nach einem der Anspruche 1 bis 4 oder einen Antikorper nach 
Anspruch 12 in Verbindung mit einem pharmazeutisch akzeptablen Verdunnungsmittel oder Trager enthalt. 

Revendications 

1. Polypeptide ayant la sequence d'acides amines presentee dans SEQ ID N". 1 sous forme pratiquement purifiee 
homologue de celui-ci ayant au moins 90 % d'homologie avec la sequence d'acides amines presentee dans SEQ 
ID N 1 sur une region d'au moins 30 acides amines contigus, ou fragment de la sequence, ou fragment de cet 
homologue, comprenant au moins 20 acides amines. 

2. Polypeptide conforme a la revendication 1 , ayant la sequence d'acides amines presentee dans SEQ ID NM . 

3. Polypeptide comprenant un polypeptide conforme a la revendication 1 ou la revendication 2, ayant la sequence 
d'acides ammes presentee dans SEQ ID N°. 3 sous forme pratiquement purifiee, homologue de celui-ci ayant au 
moms 90 % d'homologie avec la sequence d'acides amines presentee dans SEQ ID N° 3 sur une region d'au 
moms 30 acides amines contigus, ou fragment de la sequence, ou fragment de cet homologue, comprenant au 
moms 20 acides amines. y 

4. Polypeptide conforme a la revendication 3, ayant la sequence d'acides amines presentee dans SEQ ID N°. 3. 

5. ADN codant pour un polypeptide conforme a I'une quelconque des revendications precedentes. 

6. ADN conforme a la revendication 5, ayant la sequence nucleotidique presentee dans SEQ ID N» 1 ou fragment 
de celui-ci, comprenant au moins 20 nucleotides capables de s'hybrider selectivement a SEQ ID N°. 1. 

7. ADN conforme a la revendication 5 ou la revendication 6, ayant la sequence nucleotidique presentee dans SEQ 
ID N». 3 ou fragment de celui-ci comprenant au moins 20 nucleotides, capable de s'hybrider selectivement a SEQ 

8. ADN conforme a la revendication 5, la revendication 6 ou la revendication 7, ayant la sequence nucleotidique 
presentee dans SEQ ID N». 5 ou fragment de celui-ci comprenant au moins 20 nucleotides, capable de s'hybrider 
selectivement a SEQ ID N". 5. 1 

9. Vecteur de replication et depression comprenant de I' ADN conforme a I'une quelconque des revendications 4 a 8. 

10. Cellule h6te transformee ou transferee par un vecteur de replication et d'expression conforme a la revendication 9. 

11. Precede de production d'un polypeptide qui comprend la culture de cellules holes conformes a la revendication 
10 dans des conditions permettant ('expression d'un polypeptide conforme a I'une quelconque des revendications 

12. Anticorps monoclonal ou polyclonal contre un polypeptide conforme a I'une quelconque des revendications 1 a 4. 
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13. Composition pharmaceutique contenant un polypeptide conforms a Tune quelconque des revendications 1 a 4 ou 
anKcorps conforme a la revendication 12 en association avec un diluant ou un vehicule acceptable du point de 
vue pharmaceutique. H 
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Figure 7 
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figure 2 
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figure 5 
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figure 4 
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